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Any hint of non-gaussianity in the cosmological initial conditions will provide us with a unique 
window into the physics of early universe. We show that the impact of a small local primordial 
non-gaussianity (generated on super-horizon scales) on the statistics of collapsed objects (such as 
galaxies or clusters) can be approximated by using slightly modified, but gaussian, initial conditions, 
which we describe through simple analytic expressions. Given that numerical simulations with 
gaussian initial conditions are relatively well-studied, this equivalence provides us with a simple 
tool to predict signatures of primordial non-gaussianity in the statistics of collapsed objects. In 
particular, we describe the predictions for non-eaussian mass function, and also confirm the recent 
discovery of a non-local bias on large scales jj], , as a signature of primordial non-gaussianity. We 
then study the potential of galaxy surveys to constrain non-gaussianity using their auto-correlation 
and cross-correlation with the CMB (due to the Integrated Sachs- Wolfe effect), as a function of 
survey characteristics, and predict that they will eventually yield an accuracy of A/jvl ~ 0.1 and 3 
respectively, which will be better than or competitive with (but independent of) the best predicted 
constraints from the CMB. Interestingly, the cross-correlation of CMB and NVSS galaxy survey 
already shows a hint of a large local primordial non-gaussianity: /ml = 236 ± 127. 
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I. INTRODUCTION 

The standard cosmological model has been tremen- 
dously successful in explaining a large range of cosmolog- 
ical observations, ranging from cosmic microwave back- 
ground (CMB) anisotropics, to the galaxy redshift sur- 
veys, supernovae la, baryonic acoustic oscillations and 
the Lyman-cv forest in the quasar spectra. However, our 
clues into the nature of the initial conditions of the Uni- 
verse, as well as its underlying physics, remain scarce. 

The theory of cosmological slow-roll inflation remains 
the most successful and widely studied model for the cos- 
mological initial conditions. However, despite the large 
inflow of cosmological observations over the past few 
years, it remains practically impossible to distinguish a 
large class of inflationary scenarios. Moreover, given the 
limited amount of observables accessible to a today's ob- 
server, it is not clear whether any possible future obser- 
vations will be able to falsify the inflationary scenario. 
One exception to this dilemma would be a possible de- 
tection of non-gaussianity of cosmological initial condi- 
tions, which is predicted to be unobservably small in 
standard slow-roll inflationary scenarios (e.g. While 
a large primordial non-gaussianity can still be accommo- 
dated in a general inflationary scenario through intro- 
ducing fine-tuned secondary light fields, higher deriva- 
tive interactions, or non-adiabatic initial conditions, it 
could also be interpreted as a smoking gun for alterna- 
tive early universe models, such as the new ekpyrotic 
scenario Q], which naturally predicts a large primordial 
non-gaussianity pj. 
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The best current observational constraints on the pri- 
mordial non-gaussianity comes from the statistics of the 
all-sky CMB maps surveyed by the Wilkinson Microwave 
Anisotropy Probe (WMAP). Using the 3 year data re- 
lease of the WMAP CMB maps, Yadav and Wandelt @ 
claimed a detection of an /jv l- type local primordial non- 
gaussianity (see Eq. (||)), finding f^L = 87 ± 30. This is 
inconsistent with the expectation from single-field slow- 
roll inflationary models that predict a much smaller value 
\Inl \ 0.1, and might be seen as evidence for new ekpy- 
rotic scenario which predicts Jnl ~ 10 — 100 ||. How- 
ever, the more recent data release of 5 year WMAP maps 
shows a smaller value of Jnl = 51 ± 30 0, which is now 
consistent with the minimal inflationary expectation at 
the 2cr level. It is thus clear that an accurate determina- 
tion of the /jvl parameter can have important implica- 
tions for the early universe models. The next generation 
of all-sky CMB surveys, namely the Planck satellite, is 
expected to have enough sensitivity to reduce the error 
on Inl to ~ 3 (e.g. H). However, CMB secondary 
anisotropies (such as Sunyaev-Zel'dovich effect) are ex- 
pected to contaminate the CMB at this level, and thus 
prevent a more accurate measurement. 

Can we expect complementary (and competitive) con- 
straints from the study of large scale structure at low 
redshifts? This question was traditionally addressed by 
looking at the mass function of galaxy clusters, which was 
expected to be a sensitive probe of non-gaussian initial 
conditions (e.g. see [jl], ^ [l0| and references therein). 
Unfortunately, the predicted constraints are not com- 
petitive with the CMB, unless Jml has a stron g sc ale- 
dependence. Recently, Sefusatti and Komatsu Q ar- 
gued that the analysis of the bispectrum of upcoming 
large-volume galaxy surveys can yield competitive con- 
straints with the CMB, but their method relies on the ac- 
curacy of a local bias model for the distribution of galax- 
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ies on small scales. Interestingly, Dalai et al. jjj recently 
predicted the presence of an anomalous large scale power 
in galaxy distribution as a possible smoking gun for local 
primordial non-gaussianity (also see (|, |l2||). This effect 
would only be seen on very large scales, and thus cannot 
be confused with any other causal astrophysical effect 
(although Galactic contamination or large-angle survey 
systematics could give rise to a similar spurious power). 
Shortly before the submission of this paper, Slosar et al. 
jfj| published a comprehensive study of this effect in the 
present large-scale cosmological galaxy surveys, leading 
to a significant constraint: /jvx = 28±25. We will briefly 
comment on this result in Sec. VI. 

In this paper, we present a fully analytic approxima- 
tion that could recast the statistics of collapsed objects 
(i.e. galaxies, clusters, etc.) with local primordial non- 
gaussian initial conditions, in terms of the same statis- 
tics with gaussian initial conditions but a modified power 
spectrum. In Sec.|n], we introduce the approximation 
that recasts the non-gaussian linear density field in terms 
of a linear combination of gau ssia n fields, for collapsing 
objects of a given mass. Sec. Ill applies this result to 
the elliptical collapse framework, and compares the re- 
sulting mass function with numerical simulations. We 
then discuss the implications for the large scale clus- 
tering of galaxies (due to a non-local bias correction) 
in Sec.tV, and show that it could yield competitive (or 
even superior) constraints (A/jvx ~ 0.1), in comparison 
with the best upcoming CMB constraints. Finally, Sec.|v| 
shows that cross-correlation of CMB with galaxy surveys, 
due to its sensitivity to the large scale galaxy bias, can 
similarly put strong constraints on the primordial non- 
gaussianity which are also competitive with the CMB 
(A/jvi, ~ 3), and possibly less sensitive to the galaxy 
survey systematics. Current cross-correlation data be- 
tween the WMAP CMB maps and the NVSS radio galaxy 
survey already show a ~ 2a hint for such non-gaussian 
signal. 

Unless mentioned otherwise, we use the WMAP 5 best 
fit ACDM flat cosmology with fi m = 0.279, n b = 
0.0462, n s = 0.96, h = 0.701 throughout. For all our nu- 
merical calculations, we use the Eisentein & Hu analytic 
approximation [14]] for the matter transfer function that 
includes baryonic features, although we show the simpler 
BBKS, dark matter only, analytic fit [jlB] in Eqs. (g) & 
( |l2| ) , in order to illustrate the asymptotic structure of the 
transfer function. Finally, all the errors on /jvx predicted 
or quoted in this paper are at 68% (la) confidence level. 



II. NON-GAUSSIAN LINEAR 
PERTURBATIONS 



The local primordial non-gaussianity has the following 
form on super- horizon scales: 



which was first introduced i n 061, although similar for- 
mulations were used in jL7|, |l8|, [19], [20| . Here <& is the 
longitudinal (or conformal Newtonian) metric perturba- 
tion: 
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(l + 2$)dt 2 -a 2 (t)(l-2<l>)dx l dx\ (2) 



and $ p g is a statistically homogeneous random gaussian 
field with a nearly scale-invariant power spectrum. On 
sub-horizon scales, and during the matter era, the New- 
tonian potential $ is suppressed by the transfer function, 
T(k), and the metric growth factor g(t) (normalized to 
unity forl«z« 10 3 ) @: 



$ k = T(k)g(t)^ 



super- 



(3) 



The transfer function can be well approximated by the 
BBKS fitting formula p|: 



T(k = gn m /i 2 Mpc _1 ) 
[1 + 3.89g+ (16. 2g) 2 



^ ln[l + 2.Mq] 
2.34<7 X 

(5.47g) 3 + (6.71g) 4 ] 



(4) 
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while g(t) only becomes important at the onset of dark 
energy domination. 

Let us define S m via the Poisson equation 



k 2 $u 



AnGa 2 p n 



-A-\k;t)$ k , (5) 



where 



A(k;t) = -QmH^t)- 1 ^ 2 . 



(6) 



While S m reduces to the matter overdensity (= Sp m / p m ) 
in the Newtonian limit (k aH), the correspondence in 
the k < aH limit will be gauge-dependent, and generally 
does not hold. However, the determining factor in the 
dynamics of a spherically collapsing region is its density 
in units of the local critical density f2tot- This can be 
calculated in longitudinal gauge and within the matter 
era: 
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This justifies our definition of S m in Eq. (g), i.e. only 
the V 2 <!> term (and not the full matter overdensity) is 
relevant for the statistics of collapsed objects. 

Combining the above formulae, we find the following 
expression for the non-gaussian 6 m perturbations: 



f NL f d 3 k' A(\k--k'\;t)A(k';t)T(k) 



(8) 



g(t) J (27T) 3 T(\k-k'\)T(k')A(k;t) 
where 



'mG,k-k'^mG,k 



$(x) 



$ pG (x) - f NL [$ 2 G (x) - ($ 2 G (x))] , (1) 



-A- 1 (fc;t)T(%(t)$ pGjk , 



(9) 
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is the gaussian density perturbation (if /jvx = 0). In 
order to find a local form for S m in real space, let us 
consider the approximation: 

A(\k-k'\;t)A(k';t)T{k) 



T(\k-k'\)T(k')A{k;t) 
A(k';t) A(|k-k'|;t) A(k;t) 



(10) 



T(fc') T(|k-k'|) T(k) ' 

Assuming k, k' 3> k eq (associated with collapsed objects 
today), notice that for large k's A(k;t)/T(k) only loga- 
rithmically depends on k. Therefore, since Eq. ( |l0| ) holds 
for constant A/T,it must be a good approximation in the 
large k limit. The only exception to this limit is when 
k' -C k eq (|k — k'| <C k eq ), for which the first (second) 
term on the right hand side dominates (A/T cx k~ 2 for 
small k's). However, since the two other terms are still 
slowly varying, Eq. ( jlpj ) remains a good approximation. 
In fact, one can numerically confirm that Eq. ( JTo| ) re- 
mains accurate at the percent level for k' < k. 

Plugging Eq. (|l0|) into Eq. (^), we arrive at the fol- 
lowing local approximation for the linear non-gaussian 
overdensity in the real space: 
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S m (x;t,M) ~ S mG (x;t,M) 
2% G (-x;M)6 mG (x;t,M)- 



' fNL X 

<M) 
ag 



6 mG (x;t,M)Y 



(11) 



where, for collapsed haloes of mass M, we assumed k/ir 
R-^M) = [47rp m /(3Af)] 1 /3, and thus 

e(M v a(t)A(k;t) _ 1.9 X10- 5 



FIG. 1: The non-gaussian correction to o(M), the effective 
amplitude of density fluctuations for collapsed objects of mass 
M. The solid, dotted, and dashed lines are for z = 0,1,5 
respectively, while we assumed Jnl = 100. The vertical line 
shows Ms, the mass within comoving 8/i _1 Mpc. 



mass M) exceeds the mass-dependent collapse threshold 
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with 



T(k) ~ {Qr, 
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1/4 <5 ec (M) ~ g 1 / 2 ^ [1 + l3{qv)-<] , i/ = 



a(M) 



(13) 
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A second approximation that we made in writing 
Eq. ( pr| ) was to ignore the contribution of gaussian 
modes with wavelength smaller than the mass filter (fc > 
ir/R(M)) to the quadratic term. This is justified as the 
power per mode drops as ~ fc~ 3 on small scales, and thus 
the constructive interference of short wavelength modes 
in the quadratic term cannot significantly change the 
density averaged over the mass filter. In other words, 
the contribution of the Fourier modes to the total power 
in the non-gaussian term (averaged over the mass filter) 
is sharply cut off beyond k ~ n/R(M), which justifies us- 
ing the smoothed gaussian fields on the right hand side 
of Eq. (|T 



where (g, /3, 7) = (0.707,0.47,0.615) are fitted to simula- 
^ tions p2| , and 5 SC ~ 1.68 is the spherical collapse thresh- 
old. 

For gaussian initial conditions, numerical simulations 
indicate that, to a good approximation, the mass func- 
tion (or number density) of collapsed objects of mass M 
is simply a function of a(M)/5 ec (M), where a 2 (M) = 

<<^ G (x;i,M)>. 

On the other hand, Eq. ( p4| ) shows that, for the non- 
gaussian initial conditions, the local overdensity depends 
on the local values of both S mG and $ p g- T° linear order 
in Jnl, the elliptical collapse condition reduces to: 



2fNL5ec%G > Sec + (^) 



5 m > S ec 5 mG 
We can now define S mG and S ec as 



ag 



III. STATISTICS OF COLLAPSED OBJECTS 

In order to estimate the impact of non-gaussianity on 
the statistics of collapsed ob jects, we will use the ellip- 
soidal collapse framework [El], where the collapse crite- 
rion is that <5 m (x; t, M) (smoothed with a top-hat filter of 



^fNhSec^pGi 



el 



NL , 2 



ag 



(15) 



We thus find that, as 6 mG is a gaussian random field, 
we could still use the old gaussian framework, provided 
we replace S ec by a slightly modified collapse threshold 
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5 ec . Moreover, as the mass function only depends on the 
ratio a(M)/5 ec , we have: 



or(M) _ (5mG) 1/2 _ &{M) 



(16) 



where 



a(M) = d{M)f 
d K 



I 1/2 



<t 2 (M) - 4:f NL 5 ec ($ pG S mG )] 



1 o e 

ag 



a(M) 



1 - 



ag V ° 2 (M) 



(17) 



We also note that: 
a 2 (M) - 

ag($ pG 6 mG ) -- 



d 3 k 
(2^)3 

d 3 k A(k) 

(27T) 1 T(fc)' 



P(fc)VF 2 (fci?), (18) 
P(/c)J-F 2 (fci?),(19) 



where, VF(/cP) is the Fourier transform of the spherical 
top-hat filter of radius R: 



W(x) 



3 (sin a; 



x cos x 



(20) 



and 47ri? 3 p m /3 = M, while e was defined in Eq. (|l^) . 

Fig. |l]) shows the non-gaussian correction to a(M) for 
/jvl = 100 as described in Eq. (|i~7|), for three different 
redshifts. We first notice the increasing trend with mass, 
i.e. the effective non-gaussian amplification of the power 
spectrum is more significant for larger haloes/clusters. 
Secondly, we notice the increasing trend with redshift, 
which simply results from the inverse scaling with ag in 
Eq. (0). The change in the shape of the correction at 
different redshifts, however, is a result of the mass depen- 
dence of the elliptical collapse threshold (Eq. |l3|), which 
kicks in below the (redshift-dependent) non-linear mass 
scale. 

The vertical line shows Mg, the mass within comoving 
8/i _1 Mpc, which is often used to normalize the ampli- 
tude of linear density fluctuations: <jg = <r(Mg). For our 
assumed cosmology, we find approximately 
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0.137 
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0.137 
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where Aer 8 and An 8 are corrections to the effective nor- 
malization and tilt of the matter power spectrum at 
8/i _1 Mpc, if it is deduced from the statistics of collapsed 
objects at redshift z, using gaussian predictions. Here, 
D{z) oc ag is the linear density growth factor, which is 
normalized to unity ar z = 0, and we used the definition 
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FIG. 2: The relative change in the cluster mass function for 
/nl = 100. The solid (black), dotted (red), and dashed 
(green) lines correspond to our analytic prediction based on 
the a(M) correction (Eq. for 2 = 0, 0.5 and 1 respec- 

tively. The triangles, squares, and pentagons are simulations 
of Dalai et al. (if in the same order. The error bars show 
simple Poisson errors in simulated cluster numbers. 



One implication of this result is that, assuming con- 
stant /nl < 100, one needs to measure erg to better than 
1%, in order to be able to distinguish gaussian and non- 
gaussian models from cluster number counts. However, 
assuming that this is the case, the degeneracy with cts 
can be broken through the redshift evolution of the mass 
function (or the effective power spectrum). 

Fig. (H) compares the predictions of our analytic frame- 
work for the cluster mass function with the simulated 
mass functions with non-gaussian initial conditions. In 
order to find the mass function, we plug the effective 
a(M) (Eqs. (0) or (|l[f|)) into the Warren et al. ana- 
lytic formula ]23[| for the halo mass function, which was 
fitted to ACDM N-body simulations with gaussian ini- 
tial conditions. This result is compared with the mass 
functions directly obtained in Dalai et al. using non- 
gaussian initial conditions. The analytic prediction uses 
the same cosmology assumed in the non-gaussian simu- 
lations 



It appears that our analytic framework can reproduce 
the numerical results in the large mass regime (when the 
corrections are > 10%), but overpredicts the mass func- 
tion by up to ~ 50% at lower masses, although the dis- 
crepancy seems less pronounced at higher redshifts. One 
possible reason for this discrepancy is that the elliptical 
collapse threshold of Eq. (|l^) plj is based on an analytic 
fit to random gaussian fields. Using non-gaussian initial 
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conditions will most likely introduce corrections to the 
elliptical collapse threshold, which could become signifi- 
cant at small masses (although for large masses one still 
expects the spherical collapse threshold S sc ~ 1.68). 

Another possibility is the presence of an unknown sys- 
tematic error (or numerical artifact) in the non-gaussian 
simulations. For example, it is clear from the simulation 
data points in Fig. (||) that the Poisson errors are only 
an underestimate for the true numerical error. More- 
over, it appears that Dalai et al. simulations Q predict 
a lower non-gaussian correction to the mass function, in 
comparison with independent simulations of Grossi et al. 

Alternatively, one may argue that such a discrepancy 
at low masses, even if real, may be of little consequence 
as non-gaussian corrections are only large (and thus ob- 
servable) for massive clusters. Nevertheless, more thor- 
ough numerical studies (including a suite of non-gaussian 
simulations to account for sample variance), as well as 
an extension of the elliptical collapse framework to non- 
gaussian initial conditions should elucidate this issue. 



IV. CLUSTERING ON LARGE SCALES 

The clustering of collapsed objects on large scales is 
often described in terms of the bias parameter, which 
quantifies the ratio of the overdensity of collapsed objects 
to the total matter overdensity: 



mass M is given by: 



/(> M) = F(x); x 



SecjM) = 8ec{M) 

a(M) a(M) ' 



(26) 



and F(x) is a universal function that can be fitted from 
simulations, and roughly has the Press-Schecter form: 



F{x) ~ F PS (x) 



-y 2 /2 



dy. 



(27) 



Similarly, the fraction of mass in the halo mass interval 
(M, M + dM) is given by: 



f(M, M + dM) = -F'(x) 



~5 ec {M + dM) ~S ec {M) 
d(M + dM) u{M) 



(28) 

Now, adding a positive background of Sq (= 5 m G,a — 
2fNL5 ec $> P G,o) to S m a in a large patch of the Universe 
effectively decreases the collapse threshold to S ec — So 
for small scale fluctuations in that region. Therefore, the 
collapsed mass fraction in that region is slightly modified: 



F"(x) 

Sf(M,M+dM) = S l-^d 



~Sec(M) 

a(M) 



F'{x)da~ 1 {M) 
(29) 



Combining Eqs. (|28|) and (|29|), and after simple manip- 
ulations, we find: 



b{k,M) 



(<5m,k<5A-/,-k) 
(^m,k<$m,-k) 



(24) 



where 

Sis,* = Kaio(M))- 1 J d 3 x e - ik - x <5n halo (x, M), (25) 

is the Fourier transform of the overdensity of haloes of 
mass M . 

In the last section, we showed that the statistics of col- 
lapsed objects of mass M, resulting from (slightly) non- 
gaussian initial conditions, can be described in terms of 
the statistics of a new gaussian field S m G, with a modified 
elliptical collapse threshold S ec (Eq. |l5|). In particular, 
the fraction of the mass in collapsed objects larger than 



d\n5 ec {M) 
dlna(M) 



- 1 



-±(M,M+dM) = — { 

/ V ' 5 ec {M) 1 d\nx 

(30) 

which is also known as Lagranigan overdensity. Since 
the mean halo number density is given by p m f(M, M + 
dM), the (Eulerian) halo overdensity with the So back- 
ground takes the form: 



<5halo — S. 



roGCT 



Sec(M) 



d\nF' 
d\nx 



d\n~8 ec {M) 
d\na{M) 



(31) 

Recalling the definition of S, this yields the following 
expression for the (Eulerian) bias: 



b(k,M) = l+~S ec (MY 



2f NL S ec (M)A(k,t) 
g(t)T(k) 



d\nF' 
din ir 



d\n~5 ec {M) 
dlna(M) 



(32) 



(33) 
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FIG. 3: The ratio of the elliptical to spherical collapse thresh- 
olds, as a function of the scale-independent gaussian linear 
bias. The scale-dependent non-gaussian bias is enhanced by 
this factor (Eq. f||. 



FIG. 4: The scale-dependent part of bias b — boo divided by 
boo — 1, for /jv_l = 100 and z = 0. The scale-independent 
part of bias, boo = 1.5 and 3 for solid and dotted curves. The 
dashed line is Dalai et al.'s derivation flfl. 



Here b c 



} (M)=b(M;f NL = 0) + o(£f g 



is the k 



oo limit of the linear bias. For reasonable values of (con- 
stant) /jv£ < 100, similar to cr(M), this would only lead 
to percent level corrections to the gaussian bias, which we 
can safely ignore and assume boo(M) ~ b(M ; Jnl = 0). 
With this assumption, the ratio of the elliptical to spher- 
ical collapse threshold, 8 e c{M)/8 sc , (again ignoring Jnl) 
can also be described as a function of &oo by combining 
Eqs. @, (§7|), and ©. This is plotted in Fig. (g), and 
can be used to estimate the non-gaussian scale-dependent 
bias (second term in Eq. |33| ) as a function of the small- 
scale (or gaussian) linear bias. 

Fig. ([|) shows the scale-dependent part of linear bias 
(the second term in Eq. |33|) as a function of k for = 
1.5, 3 and Jnl = 100, which is compared to the Dalai et 
al. derivation Q). For small k's (k < k eq ), both biases 
scale as k~ 2 , although our prediction is slightly higher by 
the elliptical to spherical collapse ratio (Fig. (g)). How- 
ever, for k > k eq the non-gaussian scale-dependent bias 
factor is further enhanced by the inverse of the transfer 
function, which was missing in Q (and was also recently 
pointed out by ||). 

The unique aspect of this non-gaussian correction to 
the galaxy bias is that, rather than going to a constant on 
large scales, it blows up as k — ► 0. This is due to the fact 
that, on large scales, 8 m G is dominated by the $ p g term 
(see Eq. [l5]), and thus, the galaxy distribution follows the 
primordial curvature perturbations rather than the mat- 
ter density. An easy way to understand this phenomenon 
is illustrated in Fig. (0), which shows how modulation 




FIG. 5: This figure illustrates the contrast between the gaus- 
sian and non-gaussian halo/galaxy bias. The two plots show 
cartoon versions of linear density vs. spatial position. The 
white shaded areas indicate collapse regions. For gaussian 
initial conditions, different Fourier modes are uncorrelated, 
and so long wavelength modes (thin black curve) only change 
the background local mean value of small scale modes (thin 
white curves), which in turn changes the number of density 
peaks that cross the collapse threshold (thick white line) and 
form haloes. However, for non-gaussian initial condition, the 
long wavelength modes can also modulate the amplitude of 
small scale modes, which causes an additional modulation of 
collapsed halo density. 
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of the amplitude of small scale density perturbations by 
large scale modes (caused by primordial non-gaussianity) 
can lead to an additional non-gaussian bias. 

The peculiar scale-dependence of the non-gaussian bias 
correction differentiates it from any causal astrophysical 
effect, where bias should go to a constant on large scales 
(beyond the sound horizon ~ few Mpc's). Therefore, this 
effect can provide us with a smoking gun for primordial 
non-gaussianity, through observations of large scale clus- 
tering in galaxy surveys |]l[ |). In fact, Dalai et al. |l| 
claim that next generation of large scale galaxy surveys 
can yield A/jvl < 10. In Fig. (g) we illustrate this po- 
tential for a galaxy redshift survey with a given number 
of galaxies N ga i, fraction of sky coverage f s ky, and max- 
imum redshift z max . Here, for simplicity, we assume a 
uniform comoving galaxy number density, and b ga i — 2 
for the Gaussian bias. In order to compute the sensi- 
tivity of the observation of galaxy power spectrum to 
an /wL-type primordial non-gaussianity, we consider the 
gaussian x 2 '- 



dV(z) 



d 3 k \S gal (k)\ 2 
kmin (2tt -f P gal (h,z) 



lll[Pgal{k, 



where P ga i(k, z) is the power spectrum of galaxy distribu- 
tion, which, in the linear regime, is related to the matter 
power spectrum, P(fc, z), through bias and the comoving 
galaxy number density: 



P g ai(k, z) = b 2 gal {k, z)P(k, z) + nj v 



(35) 



b ga i(k) is given by Eq. (|33|), which is parameterized by 
/jvl and boo. Assuming that these parameters remain 
constant across the survey, they can be constrained ob- 



servationally through minimizing the x m Eq. (34) 



Marginalizing over b^ leads to the la error predictions 
(AjWs) shown in Fig. (§). 

We should note that as the non-gaussian bias correc- 
tion grows as fc -2 on large scales, almost all the sta- 
tistical power for constraints on Jnl comes from the 
largest wavelength modes in the survey, i.e. roughly 
A/at^ oc (kmin/V) 1 / 2 , where V is the volume of the sur- 
vey. While, for our predictions, we use the rough redshift- 
dependent estimate: 



.CO 



2tt 



[<lTTf sky r(z) 



2ll/2 ' 



(36) 



the actual constraints will depend on the large scale ge- 
ometry of the survey. We also assume k max to be the 
rough scale above which the linear bias framework can- 
not be used reliably: 



2tt 2 



1. 



(37) 



This choice also becomes important for marginalization 
over &00. 

Finally, for completeness, we should point out how the 
results of Fig. @ can be translated for different values of 



(small-scale gaussian) galaxy bias. It is easy to see that 
if we define: 

f NL = (1 - b^ l )f NL , Ng a i = b^Ngai, (38) 

the observational constraints on f^L will only depend on 
N ga i and the survey geometrical properties, but will be 
independent of the actual value of boo ■ As an exam- 
ple, if the mean bias of the galaxy sample is 1.5 rather 
than 2, the predicted errors will be 50% larger than those 
in Fig. (||) for the same survey geometry. However, to 
achieve this, the survey should also have more galaxies 
by a factor of (4/3) 2 = 16/9. 

To summarize this section, we have shown that cluster- 
ing of galaxies on large scales could provide a very sensi- 
tive smoking gun for (local) primordial non-gaussianity. 
For a (futuristic) deep enough galaxy survey, and in lieu 
of survey/ atmospheric systematics, the constraint could 
be as good as A/nl ~ 0.1 which is an order of magnitude 
better than the best predicted CMB constraints (and is 
consistent with recent prediction of jl2| ) . We will discuss 
more immediate future prospects in Sec. VI. 



V. NON-GAUSSIANITY AND THE 
INTEGRATED SACHS- WOLFE EFFECT 

In the last section, we showed that observations of 
clustering of galaxies on large scales could potentially 
lead to very sensitive constraints on a local primordial 
non-gaussianity. However, in order to be competitive 
with upcoming CMB constraints, the selection strategy 
of the survey needs to be uniform to 1 part in 10 5 on 
the largest angles (which are most sensitive to the non- 
gaussian bias). Otherwise, the correlation function will 
be dominated by the survey systematics, which can lead 
to a false detection of positive non-gaussianity. 

While such survey calibration seems daunting by to- 
day's standards, an alternative method that is not prone 
to such systematic errors is to use cross-correlation of the 
galaxy survey with an already well-calibrated large scale 
structure survey, i.e. the CMB sky. Although most of the 
CMB anisotropies on large angles were originated at the 
last scattering surface, a small fraction of them were gen- 
erated after the onset of dark energy domination through 
the Integrated Sachs- Wolfe (ISW) effect ffi§. It was pro- 
posed that the ISW effect could be detected through 
cross-correlating CMB with the galaxy surveys |2C], |2?], 
pg| , and its subsequent detection in cross-correlation of 
WMAP maps with various galaxy surveys has been inter- 
preted as i ndep endent evidence for the presence of dark 



38, 391. How- 



energy p, £U|, |3J, £2|, £3] |3J, £5] |3J, 

ever, this analysis should also be sensitive to galaxy bias 
on large scales, and therefore (as also pointed out in (lj) 
could be a sensitive probe of primordial non-gaussianity 
through the non-gaussian bias correction of the previous 
section. In fact, both the non-gaussian contribution to the 
galaxy distribution and the ISW effect follow the New- 
tonian potential on large scales, and therefore should be 
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FIG. 6: The expected precision for the measurement of local primordial non-gaussianity, using the auto-power spectrum of 
a galaxy survey (in lieu of any survey systematics) . The contours show the expected precision as a function of the survey 
sky coverage, f s ky, and the number of galaxies in the survey, N ga i (assuming uniform comoving galaxy number density and 
bgal = 2), for Z m ax = 1, 3, 5. 



well correlated. In this section we predict the potential of 
this method for constraining primordial non-gaussianity. 

In the linear regime, the perturbations in the number 
density of galaxies of a redshift survey projected onto the 
sky can be written as a line of sight integral over the total 
matter overdensity: 

5 gal (h) = N- 1 J drr 2 n gal (r)b gal (r)5 m (rn-t), (39) 

where n ga i{r) is the comoving mean number density of 
galaxies, b ga i(r) is the average bias for the galaxy distri- 
bution, and N is the mean number of galaxies per stera- 
dian. The secondary anisotropy induced in the CMB due 
to the ISW effect is given by: 



JISW 



(n) = -2 / dr -$(rn;t), 

J dr 



(40) 



where g(t) is again the growth factor for the Newtonian 
potential. The angular cross-power spectrum of the pro- 
jected galaxy distribution and the ISW effect is well de- 
scribed by the Limber approximation ||33[: 



(41) 



C{tj g al,ISW = 

i f , d\ng . , . . { 1+1/2 
-2AT 1 / dr-^-n gal {r)b ga i{r)P^ m f — -i-;t 



where P$, m (fc;t) is the 3D cross-power spectrum of the 
Newtonian potential with the comoving matter overden- 
sity. 

As we argued in the previous section, a local primordial 
non-gaussianity results in a scale dependent bias factor, 
b ga i{k, r), where k ~ ^ + ^ //2 in the Limber approximation. 
Since the non-gaussian bias correction (Eq. 3^) grows as 
k~ 2 on large scales, it can dominate the gaussian bias for 
Jnl ^ 1 on scales smaller than the horizon. Therefore, 
the cross-correlation of the CMB and galaxy distribu- 
tion can be a sensitive probe of Jnl on large angles. As 



0.4 



C\3 



a. 




FIG. 7: Cross-power spectrum between NVSS radio galaxy 
survey and the CMB sky. The data points are from Ho et al. 
| p8[ . The dashed line shows the expectation from WMAP5 
concordance cosmological model assuming a constant bias, 
while the solid line uses the best (unmarginalized) fit non- 
gaussian bias to the data, which corresponds to f ml = 236 ± 
127. 



an example, Fig. ([?]) compares the expected cross-power 
spectra between NVSS radio galaxy survey and the CMB 
sky. The data points are the recent measurements from 
Ho et al. (H 



The dashed line shows the expectation 
from WMAP5 concordance cosmological model and the 
redshift distribution provided in |38|, assuming a con- 
stant bias. The solid line uses the best (unmarginalized) 
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FIG. 8: Optimum full-sky signal-to-noise for the detection of 
a possible }nl through cross-correlation of a full-sky redshift 
survey (assuming b ga i ~ 2) with the CMB sky. The four 
curves correspond to the maximum redshift of the survey, 
z m ax = 0.5, 1, 2, and 5. 



fit non-gaussian bias to the data, which corresponds to 
f NL = 236 ± 127. 

While this result can only be cautiously interpreted 
as a 2cr hint for a local primordial non-gaussianity, we 
notice that it has the same sign (and comparable magni- 
tude) as the current 2 — 3<r evidence for primordial non- 
gaussianity from WMAP CMB maps (see Sec. |), even 
though they correspond to different scales, eras, and mea- 
surement methods. 

Finally, we can predict how well one can measure the 
local non-gaussianity from cross-correlation of CMB with 
future galaxy surveys. To do this, we follow the method 
outlined in [ f40|] , where we divide up the galaxy survey 
into separate spherical shells and add up the signal-to- 
noise for the detection of a given Jnl in each shell in 
quadratures. The error for this measurement is propor- 
tional to the clustering of galaxies, and so the correlation 
between measurements in different shells can be safely ig- 
nored on large scales (at least for small Jnl)- Within this 
approximation, the optimum signal-to-noise for a (full- 
sky) detection of a given Jnl is given by: 



dr(z) ( d\ng\ 2 ( Ab ga i(k,z) 



(42) 



p2 



r{z) 2 \ dr J \ b ga i(k,z) ) P{k,z) 
where Ctt (£) is the angular auto-power spectrum of the 



CMB, and P(k, z) is the linear matter power spectrum. 
Moreover, b ga i and Ab ga i (oc /nl) ar e the total bias and 
its non-Gaussian correction (obtained in Eq. [33|) respec- 
tively. As in the Limber approximation, k ~ e+1 / 2 m 
Eq. @. 

Fig. (|8|) shows the predicted signal-to- noise (from 
Eq. (|43|)) for the detection of primordial non-gaussianity 
through cross-correlation with the CMB for a full-sky 
redshift survey that extends out to z ma x = 0.5,1,2, 
and 5, where we assumed the small-scale gaussian bias 
bg a i — 2 for concreteness. Therefore, we predict that 
cross-correlation of CMB and large scale structure sur- 
veys can eventually be competitive with the CMB surveys 
in providing independent constraints on local primordial 
non-gaussianity at the level of A/jvl ~ 3. 

In this calculation, we have ignored the Poisson noise 
(due to a finite number of galaxies) or other possible sur- 
vey systematics. However, we should note that since 
these errors do not contribute to the ISW signal |fi6| , 
they cannot cause a systematic bias in Jnl determina- 
tions from ISW-galaxy cross-correlation analyses. Nev- 
ertheless, Poisson noise/survey systematics can increase 
the random error in cross-correlation, which is manifested 
through increasing the apparent auto-correlation of the 
galaxy survey. Since we have already calculated the sen- 
sitivity of the galaxy auto-correlation function to /jvl 
in the last section (Fig. (0)), we can say that assuming 
Inl ^ 3, for Ng a i > 10 7-8 the galaxy auto-correlation 
will become roughly insensitive to the Poisson noise on 
large scales (where the non-gaussian signal dominates). 
Therefore, the determination of Jml from ISW-galaxy 
cross-correlation will not significantly suffer from Pois- 
son noise, as long as the number of galaxies in a (full-sky) 
survey is larger than 10 7-8 . Assuming that survey sys- 
tematics also look similar to Poisson noise on large angles, 
this also implies that the calibration of the g alax y survey 
selection function must be better than 1/VlO 8 = 10~ 4 , 
in order not to affect the sensitivity of non-gaussianity 
measurements. 



VI. CONCLUSIONS AND FUTURE 
PROSPECTS 

In this paper, we have studied the impact of of a local 
primordial non-gaussianity (expected from super-horizon 
evolution in multi-field inflationary models or the new 
ekpyrotic scenario) on the statistics of collapsed objects 
in the late universe. We developed an approximation 
that describes the linear non-gaussian density field as 
a local bilinear combination of gaussian fields. Apply- 
ing this to the elliptical collapse framework enabled us 
to find an analogue gaussian linear density field, which 
should produce the same predictions as the non-gaussian 
system for collapsed objects of a given mass, assuming 
small primordial non-gaussianity. For halo mass func- 
tion, this prescription leads to an effective analytic cor- 
rection to a(M), the amplitude of linear fluctuations on 
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mass scale M, which can be readily applied to Press- 
Schechter-like analytic approximations for the mass func- 
tion. We found this to give roughly consistent results 
with direct simulations with non-gaussian initial condi- 
tions. We then focused on the impact of non-gaussianity 
on the clustering of galaxies, and confirmed the recent 
discovery of a non-local bias correction associated with 
local non-gaussian initial conditions || . We found that 
auto-correlation of galaxy surveys can ultimately lead to 
a precision A/jvl ~ 0.1 which is an order of magnitude 
better than the best anticipated CMB constraints. Cross- 
correlation of CMB and galaxy surveys (through the ISW 
effect), even though suffering from a low signal-to-noise 
due to primary CMB contamination, is less prone to sur- 
vey systematics, and can also yields A/jvl ~ 3, which 
is again competitive with future CMB constraints. In- 
terestingly, the cross-correlation of CMB and the NVSS 
radio galaxy survey already provides a 2a hint for a large 
primordial non-gaussianity /jvx = 236 ± 127. 

Finally, we should comment on the future devel- 
opments of the studies of primordial non-gaussianity. 
While five more years of WMAP observations can only 
marginally improve the current constraints to A/jvl ~ 
15 — 20, the most anticipated observational improve- 
ment is the first data release of Planck satellite, expected 
in 2012. After careful accounting for CMB secondary 
anisotropies, as well as the 2nd order corrections to the 
CMB Boltzmann solvers, Planck is expected to achieve 
A/jvl ~ 3. However, as we demonstrated in this pa- 
per, competitive or even superior constraints can come 
from the study of clustering of large scale galaxy surveys. 
For example, Large Synoptic Survey Telescope (LSST) is 
expected to survey half of the sky down to magnitude 
r ~ 27.5 finding ~ 10 10 galaxies out to z max ~ 2. Using 
Fig. (H), LSST can yield a precision of Af^L ~ 2 — 3 
through its galaxy auto-correlation function. The cross- 
correlation of LSST with the CMB sky could indepen- 
dently yield Af^L ~ 7. Similar proposed or underway 
deep, large area surveys such as Pan-STARRS, ADEPT, 
CIP, etc. could potentially lead to comparable con- 
straints. However, one important point to note is that 
precise spectroscopic redshifts are not required for non- 
gaussianity determination, as most of the non-gaussian 
signal comes from the largest scales of the survey. 

Shortly before the submission of this paper, Slosar et 
al. |L3| published a comprehensive study of this effect in 
the present large-scale cosmological galaxy surveys, lead- 
ing to a significant constraint: /jvl = 28 ±25. Nearly all 
the statistical weight of this constraint is based on the 
lack of anomalous power in the lowest 1-bin of the auto- 
power spectrum of a high redshift sample of ~ 10 5 SDSS 
photometric QSO's, which cover 15% of sky and have an 
effective mean bias of ~ 2.75. Plugging these numbers 
into the error estimates of Sec. IV (Fig. we would 



predict an error of about a factor of 2 worse than what 
was found in |0|. As we pointed out in Sec. IV, the 
actual error on /jv l will be sensitive to the survey geom- 
etry, and so this disagreement between our rough error 



prediction and Slosar et al.'s findings may not be an im- 
mediate cause for alarm. Nevertheless, it is curious that 
despite the lack of any redshift information (which was 
used in our error prediction), as well as possible contam- 
inations/survey systematics, Slosar et al. could achieve 
an accuracy in f^L comparable or better than an op- 
timistic error prediction. A further concern is that the 
correlated Poisson noisef typically expected from non- 
linear clustering, e.g. pji) is not marginalized over in 
their analysis. 

On a different note, Slosar et al. point out that a de- 
generacy between f^L and the redshift and bias distri- 
bution of NVSS survey (which are both poorly known), 
hinders any accurate determination of fjsiL from NVSS 
data, or its correlation with the CMB. While, at face 
value, this might seem inconsistent with our ~ 2er evi- 
dence for non-gaussianity from the correlation of NVSS 
with WMAP, we should note that both redshift and bias 
distributions may eventually be fixed, either from theo- 
retical modeling, or direct observations, which can then 
realize the full statistical power of cross-correlation con- 
straints. 

On the theoretical front, we are still far from a good 
theoretical understanding of non-linear structures from 
non-gaussian initial conditions. While the present work 
provides a bridge between this problem, and the huge 
body of work on non-linear/collapsed structures from 
gaussian initial conditions (such as the semi-analytic 
frameworks, the halo model, as well the N-body/hydro 
numerical simulations), it does suffer from some sim- 
plifying assumptions. The first is the ellipsoidal col- 
lapse scenario ||l], [22]. Although for large masses and 
fNLe(M) <C 1 (i.e. near-gaussian initial conditions), the 
collapse should be well-described as spherical (leading 
to a universal spherical collapse threshold), for smaller 
masses, the threshold depends on the ellipticity of the 
proto-halo, which is a random number with a distribu- 
tion set by the statistics of the initial conditions. The 
current analytic form (Eq. |l3|) is a fit to random realiza- 
tions of gaussian fields with ACDM power spectrum, and 
thus there is no reason to exclude corrections linear in 
/jvl, if the linear density field obeys non-gaussian statis- 
tics. Therefore, one clear direction for improvement is to 
investigate/extend the ellipsoidal collapse framework for 
non-gaussian initial conditions. 

The second simplification (and probably the most non- 
trivial part of this work) is the local approximation of 
Eq. (|l l|) , which was instrumental in all the subsequent 
analysis/predictions. While the consistency of our re- 
sults for mass function and clustering with other numeri- 
cal and analytic studies jj], ||] verifies the accuracy of this 
approximation, Eq. (jll]) could also be viewed as a conve- 
nient parametrization for the primordial non-gaussianity. 
To see this, note that for an /jvx-type non-gaussianity 
(Eq. [l]), the bispectrum of <&, B(k±, fe, fc 3 ), is written in 
terms of the power spectrum of the Gaussian field & p g, 
Pp G (k): 

B(k u k 2 ,k 3 ) = -2f NL P pG (k 1 )P pG {k 2 )P p G(k3) x 
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PpG + ^(k) + Pjih) , (43) 



pG 



pG' 



where P p a(k) cx fc~ 3 for scale- invariant spectra. While 
our local approximation does not exactly preserve this 
form, different observables are mostly sensitive to a lim- 
ited region in the fci,fc2j&3 space, where Eq. ([ll]) may 
provide a simple parametrization. For example, it is rea- 
sonable to think that measurements of mass function 
of haloes of mass M would only be sensitive to non- 
gaussianity around k\ ~ k 2 ~ &3 ~ n/R(M) (see the 
line preceding Eq. |l2|). Therefore, constraints on f NL 
using mass function could be roughly translated into con- 
straints on the bispectrum for near- equilateral triangles 
of the side - %/R(M). 

A more interesting case is the clustering (or the power 
spectrum) of c ollap sed haloes (e.g. galaxies) that we 
studied in Sees. |lV[-|v|. In this case, we are dealing with a 
hierarchy of scales: k% <C fei, fe, where k\ -C 0.01 Mpc -1 
is the scale over which we measure the galaxy power 
spectrum, and as we showed pre-dominantly corresponds 
to the modes of the size of the survey. On the other 
hand, ki,k 2 ~ w/R(M) > 0.1 Mpc -1 correspond to 
the comoving size of the collapsing regions. Therefore, 
observations of the galaxy power spectrum provides an 
exquisite measure of the bispectrum for squeezed tri- 
angles. In particular, the non-gaussian bias correction 



A&jvgC^) oc k^ 2 found in Sec. [IV| (as well as in [jl], 
are specific to the local form of non-gaussianity (Eq. 
which is only produced through super-horizon evolution, 



where B(k\, k 2 , fe) &3 — > 0. In contrast, for 

non-gaussianities produced through sub-horizon evolu- 
tion (such as in slow-roll single field inflation, DBI, ghost 
inflation and k-inflation) , B{k\, k 2l k^) — > const, or 
in this limit E2] . Therefore, by way of comparison, the 
scale-dependent bias A&jvg(&3) — > (and so is unob- 
servable) in the long wavelength limit, for models that 
produce equilateral primordial non-gaussianity. 

We thus close this paper by the exciting (although, 
by now, familiar) premise of the cosmological microscope 
as a probe of the fundamental physics of the early uni- 
verse. In particular, that an anomalously large clustering 
of galaxies on large scales will be a unique smoking gun 
for non-linear super-horizon evolution (and thus large 
primordial isocurvature modes) during the early stages 
of cosmogenesis. 
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confused with D(z), the overdensity growth factor, 
which grows as a in the matter era, and is often 
normalized to unity at 2 = 0. 

Here, we have ignored the bias dependence of the 
elliptical collapse threshold (Fig. pi), as it could only 
change the error predictions by 10 — 20%. 
A notable exception is Galactic foregrounds in the CMB 
which could be correlated with Galactic contaminations 
of galaxy surveys. However, similar to the analysis of 
primary CMB anisotropies ^3), the frequency 
dependence of these foregrounds can provide a way to 
clean the cross-correlation signal from Galactic 
contaminations. 



